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METHOD OF ASSOCIATING MULTI -LINGUAL AUDIO RECORDINGS WITH 
OBJECTS IN AN INTERNET PRESENTATION SYSTEM 



Technical field 

The present invention relates to Internet systems enabling to 
5 conduct conferences and presentations by means of charts or 
slides accompanied with audio recordings to audiences anywhere 
in the world and relates in particular to a method of 
associating multi-lingual audio-recordings with visual objects 
in an Internet presentation system. 

10 Background 

Today, audiences including thousands of participants in the 
world can attend conferences and presentations broadcast by 
the Internet network. During such a presentation, the 
participant can view visual objects such as charts, slides, 
15 images, graphics (generally in PowerPoint) and listen to 
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prerecorded audio recordings associated with each visual 
object . 
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In general, the speaker who is in charge of adding audio 
comments to the visual object, does not assemble the final 
deliverable contents but could, via access to the Internet web 
pages, invoke the assembly process. The assembly is normally 
done by a third party who has the skills in creating the 
synchronized objects or an automatic system. 

But there are many problems associated with creating a 
presentation having a synchronized audio recording to the 
visual objects. Thus, the speaker must have a ready access to 
equipment like a recording machine, a microphone or a PC with 
audio support to create the recording. An alternative is to go 
to a recording studio and to use facilities there, but this is 
not always immediately available and it is an expensive 
solution. If the user chooses to use a tape recorder, the 
third party which receives this media must have a similar 
equipment available to replay the recording. 

Associating the audio recordings with the objects is also a 
problem. Indeed, the objects to which the audio files are 
associated, normally follow some sequential numbering scheme. 
However, there may not be sequential audio recordings due to 
some objects not requiring audio or the objects are not 
sequential. The speaker who is the person defining which 
objects require audio has to provide explicit documentation on 
the association. Such a manipulation being manually performed, 
the process can be subject to errors. 



30 



Ensuring that each audio recording is correctly associated 
with the visual objects requires that the people in charge of 
the association must be familiar with the language used or 
have very explicit instructions in their native language to 
correctly associate the audio recordings with the objects. The 
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third party may be associating more than one language to a 
single object. If the recording is not pre-edited into 
segments, it requires editing and therefore an expertise in 
the language spoken by the speaker is required. 

5 Summary of the invention 

Accordingly, the main object of the invention is to provide an 
Internet presentation system enabling the user to make audio 
recordings associated respectively with the visual objects of 
the presentation whatever the language used in the audio 
10 recording. 

The invention relates therefore to a method of associating 
multi-lingual audio recordings with visual objects in a 
presentation system accessed by the Internet network wherein a 
plurality of visual objects such as charts or slides 

15 controlled by a third party in a server can be accessed by any 
user of the Internet network, the visual objects being 
associated each with an audio recording in any language 
selected amongst several predetermined languages. Such a 
method comprises the steps of calling the third party audio 

20 recording server by the user in charge of the audio recording 
associated with a visual object via the Public Switched 
Network, prompting the user to enter his user ID (which will 
determine the language of subsequent voice prompts) ; and 
transmitting by the keypad of the telephone a user ID 

25 including a first part defining a first language to be used 
for recording the audio recording. 

Then, the user can enter his password and proceed to make 
audio recordings in the chosen language. He selects the chart 
or object he wants to make a recording for and makes a 
30 recording. He can replay that recording at anytime and 
re-record the audio if not satisfied. On completion of the 
recordings, the user then initiates an automatic update 
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process which automatically synchronizes the audio recordings 
to the visual objects. 



Brief description of the drawings 

The above and other objects, features and advantages of the 
5 invention will be better understood by reading the following 
more particular description of the invention in conjunction 
with the accompanying drawings wherein : 
■Fig. 1 is a block-diagram representing the general context 
wherein the invention is implemented. 
10 "Fig. 2 is a block-diagram of the functional means used by 

the third party to implement the invention 
■ Fig. 3 is a schematic diagram showing the recording of two 
audio files in English and in French by the system of the 
invention . 

15 « Fig. 4 represents a flow chart of the different steps used 

in the method according to the invention. 

Detailed description of the invention 

Assuming that the third party responsible for the presentation 
system has got the charts or slides to be presented to 
20 customers connected to the Internet network, a speaker (called 
the user in the following) has to record audio files each 
being associated with each chart or slide. 

As illustrated in Fig. 1, the user has at his disposal a 
workstation 10 connected to the Internet network 12 and a 
25 telephone set 14 connected to the Public Switched Network 
(PSN) 16. The third party server 18 including the charts 
slides or objects in general which can be broadcast to 
customers by the Internet network 12 is also connected to PSN 
16. 
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Prior to using the presentation system, the user is provided 
with a user identification number ID, a password and a 
telephone number (via web pages, e-mail or conversation with 
the third party) . When the user wants to record the audio 
5 recordings, he calls the third party server 18. At this time, 
the user is prompted to enter his ID by means of the keypad on 
his telephone 14. Note that this voice prompt can be in 
multiple languages. As illustrated by the block-diagram of 
Fig. 2, the ID 2 0 is processed in the server by a processing 
10 unit 22 which represents the sophisticated facilities 
{hardware and software) of the server to control and manage 
the system. 

The ID entered by the user determines the language to be used, 
the naming convention for the file and the location for the 

15 stored recordings. In general the third party has stored the 
objects for which audio is to be recorded in the same location 
as that specified by the ID in a directory 23, or the user 
through an automated process will place the objects into the 
location specified within the ID. Thus, for example, the ID 

20 may contain the number 6666 which is the directory name in 
which the final recordings are to be stored. 

On receiving the ID, processing unit 22 determines the 
language in which it must prompt the user and the file naming 
convention it must use to store the audio recordings. For 

25 example, the ID could be 366661 meaning that the directory 
number is 6666 and the language code is 31. Note that the 
number 31 could be placed at the beginning or at the end of ID 
entered by the user. With this language code, the processing 
unit 22 determines in a language table 24, which is the 

30 language will be used to prompt the user and the file naming 
language used for the recording . 

When processing unit 22 has determined the language to be used 
and after the user has entered the number of the visual object 
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associated with the audio recording, it generates a file 
number 2 6 to designate the recording file to be stored in a 
data base 28. In the example, such a file number could be 
6666:engxx wherein 6666 is the directory name, eng corresponds 
5 to the language "English" corresponding to language code 31 in 
table 24 and xx is the number of the chart or slide with which 
the recording is to be associated. 

After that, the user can record the audio recording which is 
stored in data base 2 8 in a file having the file number 2 6 as 
10 defined above. The processing unit also writes a flag (one 
bit) at the end of the file for indicating that it is a new 
recording which will need to be copied in directory 23 . The 
copying operation is achieved by the processing unit when the 
user requests an "update" as explained below. 

15 Referring to Fig. 3, it is shown how the language code 
included in the ID entered by the user is an essential feature 
of the invention. As a matter of fact, with a first language 
code, the processing unit determines in the language table 
that English has to be used for the recording. Using the file 

20 naming convention, the processing unit defines a file number 
32 for example 6666:engxx. This number is used to record the 
English recording 34 in a first location of the data base 28. 
But, assuming that French is now the language to associate 
with the same chart or slide, a different file number 36, for 

25 example 6666:frexx, is defined by the processing unit for the 
new recording. Then, this number will be used as file number 
to record the French recording 38, and to store this file into 
another location of data base 28. Whatever language is used, 
the different recordings corresponding to the same 

30 presentation will be then copied in directory 2 3 at the 
location defined by the same directory name, 666 6 in the 
example . 
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Now, the process according to the invention is described in 
reference to Fig. 4. After inputting the system (step 40) by 
calling the third party via the Public Switched Network, the 
user transmits his user ID (step 42) . As explained above, the 
5 processing unit of the server determines the language to be 
used (step 44) and the directory in which the recording has to 
be copied (step 46) . Then, the user has to enter a password 
(step 48) which is used to determine whether he is a valid 
user of the ID. 

10 After that, the user has to decide whether he wants to 
"update" that is to copy recordings already stored from the 
data base to the directory determined in the ID (step 50). If 
so, the user is disconnected (step 52) . Such a disconnection 
is necessary since, during the copying process which takes a 

15 short time (about 3 0-45 seconds) the user cannot make any 
changes to the recordings. If the user calls back immediately, 
it can be expected that, by the time he has entered the ID and 
the password, the copying process will be completed. Note that 
the recordings have a flag which is set to 1 when they are 

20 recorded, are reset to 0 after they are copied into the 
directory so that only the recordings having a flag set to 1 
are copied which reduces considerably the work load on the 
system. 

Then, it is determined whether the user has a chart to be 
25 recorded (step 54) . If not, the connection was only to update 
the existing recordings and the process is exit (step 56) . If 
there is at least a chart to be audio recorded, the user 
enters the chart number (step 58) . It must be noted that the 
message to be entered is generally split into 3 parts wherein 
30 the chart number can be part 2 followed by instructions or the 
instructions can be in part 1 and part 3 is a blank recording, 
depending upon the language which is used. 
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Generally, at this stage, the user is prompted by the system 
to know whether he wants to listen to a previous recording or 
to have a continuous playback of the already recorded audio 
files (step 60) . The feature of continuous playback is to 
5 allow the user to hear if there is a continuity between the 
recordings. This is a valuable productivity aid for the user 
since he does not have to select the next chart and more 
valuable when there are multiple recordings since it is a way 
of easily finding missing recordings in a sequence of 
10 recordings. In the latter case, the system states that no 
recording exists and can prompt the user if he wants to record 
one . 

Then, it is determined whether a recording exists for the 
chart number entered by the user (step 62 ) . If not, the user 

15 is prompted to start the audio recording (step 64) which will 
be stored in the data base with the file number defined from 
the ID as previously explained. Again, the user may be 
requested to listen to the recording or to playback all the 
recordings (step 66) . As the user may be not entirely 

20 satisfied with the recording, it is determined whether he 
wants to make a re-recording (step 68) . If not, the process is 
looped to the beginning before the update (step 50) . 

Assuming that there is already a recording corresponding to 
the chart number entered by the user, the same way as 

25 previously, it is determined whether the user wants to make a 
re-recording (step 70) . If it is the case or if after 
recording the audio file, the user is not satisfied (see step 
68), the user may start to re-recording (step 72). Again, the 
user may generally listen to his re-recording or make a 

30 continuous playback of the recordings (step 74) . Of course, 
when the user does not want a new recording after the first 
recording, he does not start re-recording and this step is 
jumped. Then, the process loops to the beginning before the 
update (step 50) . 
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It must be noted that the user can all the time have access to 
the system for returning to the "update" step or change any 
one of the recordings by calling again the third party, 
entering the ID and then the password. For practical purposes, 
5 there could be a time limit set on access time. 

When a recording is changed afterwards, the flag is also set 
to 1 in the audio file so that only this recording will be 
copied from the data base to the directory. 

When recording an audio file for a chart or slide, the 
10 synchronization between this one and the audio recording is 
automatically achieved. When the chart or slide will be 
displayed to a presentation participant, it will be displayed 
during all the time of the audio recording. In other words, 
the same chart is synchronized with the recordings 
15 corresponding to the different languages. However, the 
presentation has always the option to stop the audio playing 
or to go quickly to the next chart . 

Although the system is used today for producing an audio 
recording that is associated with a slide or chart, it is not 

20 limited to producing audio recordings for only this purpose. 

Many objects (slides, text, html pages, animation etc.), today 
require narration or audio, the addition of audio to one of 
these objects, to enhance the end user experience by being 
able to see the objects and at the same time have a supportive 

25 audio track. So, as technology changes, then this equipment 
can be used in support of newspaper articles, where you hear 
the reader etc. Also, as the web moves to encompass those 
users with physical disabilities, there is a need to produce 
audio versions of visible text or objects. The system could be 

30 used to record these. Similarly, the final delivery medium may 
not be the Internet, it could be a CD, diskette, be stored on 
a server and accessed by other means . 
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CLAIMS 

1. Method of associating multi-lingual audio recordings with 
visual objects in a presentation system accessed by the 
Internet network (12) wherein a plurality of visual objects 

5 such as charts or slides controlled by a third party in a 
server (18) can be accessed by any user of said Internet 
network, said visual objects being associated each with an 
audio recording in any language selected amongst several 
predetermined languages ; 
10 said method comprising the steps of: 

calling said third party server by the user in charge of 
the audio recording associated with a visual object via the 
Public Switched Network (16) , 

prompting said user on his telephone (14) to enter his 
15 user ID; and 

transmitting by the keypad of said telephone said user ID 
including a first part defining a first language to be used 
for recording said audio recording, 

whereby the displaying of said visual object by said 
20 workstation will be automatically synchronized with said audio 
recording. 

2. Method according to claim 1, further comprising the step 
of entering the number of said visual object associated with 
said audio recording by the user at said telephone (14) after 

25 he has transmitted said user ID. 

3. Method according to claim 1 or 2, further comprising the 
step of transmitting a password by the user at said telephone 
(14) after this one has transmitted said user ID. 

4. Method according to claim 3, wherein said user ID 
30 includes a second part defining a directory name wherein the 

file containing said audio recording is stored. 
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5. Method according to claim 4, further comprising the step 
of generating by said third party server a first file name 
including said directory name and a code defining said first 
language when ID is received by said third party server (18) . 

5 6. Method according to claim 5, wherein said audio recording 
in said first language is stored in a data base (28) by using 
said first file name. 

7. Method according to claim 6, further comprising the steps 
of transmitting from said telephone (14) another ID containing 
10 a first part defining a second language to be used for 
recording said audio recording and a second part defining said 
directory name, and generating by said third party server 
another file name including said directory name and a code 
defining said second language. 

15 8. Method according to claim 6 or 7 , further comprising the 
step of copying said audio recording corresponding to said 
first file name from said data base (28) to a directory (23) 
at said directory name, said step of copying being activated 
by the user at said telephone (14) . 

20 9. Method according to claim 8, wherein the file containing 
said audio recording includes a flag which is set when said 
audio recording is recorded for the first time or changed and 
which is reset when said file is copied from said data base 
(28) to said directory (23) . 

25 10. Presentation system accessed by the Internet network (12) 
wherein a plurality of visual objects such as charts or slides 
controlled by a third party in a server (18) can be accessed 
by any user of said Internet network, said visual objects 
being associated each with an audio recording in any language 

30 selected amongst several predetermined languages ; 
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said system comprising a third party server including 
processing means (22) for receiving a user ID from a user 
telephone (14) , said user ID comprising a first part defining 
a first language to be used for recording said audio recording 
5 whereby the displaying of said visual object by said 
workstation will be automatically synchronized with said audio 
recording . 

11. System according to claim 10, wherein said third party 
server (18) comprises a table (24) for determining said 

10 language to be used for recording said audio recording from 
said first part in said user ID. 

12. System according to claim 11, wherein said user ID 
includes a second part defining a directory name, wherein the 
file containing said audio recording is stored, and further 

15 comprising a directory (23) to store said audio recording at a 
location determined by said directory name. 

13. System according to claim 11, further comprising a data 
base (28) wherein said file containing said audio recording is 
stored after being recorded. 

20 14. System according to claim 12, wherein said file 
containing said audio recording is copied in said directory 
(23) after activation of said processing means (22) by the 
user at said telephone (14) . 
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METHOD OF ASSOCIATING MULTI- LINGUAL AUDIO RECORDINGS WITH 
OBJECTS IN AN INTERNET PRESENTATION SYSTEM 



Abstract 

Method of associating multi- lingual audio recordings with 
5 visual objects in a presentation system accessed by the 
Internet network (12) wherein a plurality of visual objects 
such as charts or slides controlled by a third party in a 
server (18) can be accessed by any user of the network, the 
visual objects being associated each with an audio recording in 

10 any language selected amongst several predetermined languages. 

This method comprises the steps of calling the third party 
server by the user in charge of the audio recording associated 
with a visual object via the Public Switched Network (16) , 
prompting the user on his telephone (14) to enter his user ID; 

15 and transmitting by the keypad of the telephone the user ID 

including a first part defining a first language to be used for 
recording the recording, whereby the displaying of the visual 
object will be automatically synchronized with the audio 
recording . 

20 FIG. 1 
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